How robust is a quantum gate in the presence of noise? 
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We define several quantitative measures of the robustness of a quantum gate against noise. Ex- 
act analytic expressions for the robustness against depolarizing noise are obtained for all unitary 
quantum gates, and it is found that the controlled-NOT is the most robust two-qubit quantum gate, 
in the sense that it is the quantum gate which can tolerate the most depolarizing noise and still 
generate entanglement. Our results enable us to place several analytic upper bounds on the value 
of the threshold for quantum computation, with the best bound in the most pessimistic error model 
being p t h < 0.5. 
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I. INTRODUCTION 

An ideal quantum computer is usually described as 
a sequence of unitary quantum gates applied to the qubits 
making up the computer. A typical universal set of quan- 
tum gates is the controlled-NOT gate, and single-qubit 
unitary operations Q. A crucial element in a universal 
gate set is that it be capable of generating entanglement 
between the qubits making up the computer. 

In the real world quantum gates suffer from noise 3], 
which can inhibit the creation of entanglement. This 
problem led to the development of fault-tolerant meth- 
ods for quantum computation (see the discussion and 
references in 0) based on quantum error-correcting 
codes 0, One of the outstanding achievements of 
work on fault-tolerance is the threshold theorem for quan- 
tum computation 0,0, H> IU 113 ■ The threshold theorem 
states that, under reasonable physical assumptions about 
noise in the computer, it is possible to correct for the ef- 
fects of that noise, provided the strength of the noise is 
below some constant threshold, pth- (Roughly speaking, 
Pth can be thought of as the maximal probability of error 
during a single quantum gate that can be corrected using 
the methods of fault-tolerance.) The exact value of the 
threshold depends on what assumptions are made about 
the noise in the quantum computer, and estimates of the 
value of the threshold therefore vary quite a bit. Typical 
current estimates place it in the range 10 -4 to 10~ 6 . 

Motivated by the practical problem of noise, and the 
theory of fault-tolerant quantum computation, in this pa- 
per we consider the problem of quantifying how robust a 
quantum gate is to the effects of noise. More precisely, 
for a given gate U we attempt to quantify how much 
noise the gate can tolerate while preserving the ability to 
generate entanglement. Since, in a sense we make precise 
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below, entanglement generation is necessary for quantum 
computation to be possible, even if the methods of fault- 
tolerant computation are used, this program allows us to 
determine upper bounds on the value of the threshold. 

Our work is different from most other work on es- 
timating thresholds, which usually aims to determine 
lower bounds. The interest in lower bounds stems from 
their more immediate practical interest: if we know that 
Pth > 10 -6 , for example, then that gives experimental- 
ists a target to shoot for in pursuit of a working quan- 
tum computer. Nonetheless, as emphasized in from 
a fundamental point of view it would be extremely in- 
teresting to have exact values for the threshold, and this 
requires techniques for obtaining upper bounds. 

Our work is based upon the results of Vidal and Tar- 
rach 01 j who investigated the robustness of entangled 
quantum states, that is, how much noise can be added 
to a quantum state before it becomes unentangled, i.e., 
separable. Our work also naturally extends and comple- 
ments the work of Aharonov and Ben-Or who, to 
our knowledge, have done the only prior work obtaining 
upper bounds on the value of the threshold. 

Another interesting context in which our measures of 
gate robustness may be placed is the program of defin- 
ing "dynamic strength measures" for quantum dynamical 
operations [13j . Dynamic strength measures quantify the 
intrinsic power or strength of a quantum dynamical op- 
eration as a physical resource, much as an entanglement 
measure quantifies the entanglement in a quantum state. 
[l3| developed a framework for the analysis of dynamic 
strength measures, and we will see that gate robustness 
can be regarded as a measure of dynamic strength, and 
analyzed within this framework. 

The structure of the paper is as follows. Sec. [D] re- 
views background material on the Schmidt decomposi- 
tion for operators. This decomposition is central to our 
later work on the robustness of quantum gates. Sec. IIIII 
reviews the notion of separable quantum gates, which 
may be defined as the class of gates that cannot gener- 
ate entanglement in a quantum computer. Furthermore, 
this section proves that a quantum circuit containing only 
separable gates can be efficiently simulated on a classical 
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computer. Sec. IIVI reviews Vidal and Tarrach's work on 
the robustness of quantum states. This section also in- 
troduces a novel measure of the robustness of quantum 
states useful in our later work on gate robustness, and 
proves some elementary properties of the new measure. 
Sec. Ivl gives our definitions and results on the robustness 
of quantum gates, and relates the results to the theory of 
fault-tolerant quantum computation. Sec. IVII concludes. 

II. THE OPERATOR-SCHMIDT 
DECOMPOSITION 

The operator- Schmidt decomposition is an operator 
analogue of the well-known Schmidt decomposition for 
pure quantum states The present treatment of the 
operator-Schmidt decomposition is based on the discus- 
sion in 0, 0| , with the addition of a result on the con- 
tinuity of the Schmidt coefficients of a unitary operator. 

We begin by introducing the Hilbert-Schmidt inner 
product on d x d operators, (Q,P) = tr(Q' l 'P), for any 
operators Q and P. We define an orthonormal opera- 
tor basis to be a set {Qj} which satisfies the condition 
(Qj,Qk) — tr(Q^Qfe) = 5jk- For example, an orthonor- 
mal basis for the space of single-qubit operators is the 
set {I/V2, X/V2, Y/V2, Z/V2}, where X, Y, and Z are 
the Pauli sigma operators, and I is the identity. 

The operator-Schmidt decomposition states that any 
operator Q acting on systems A and B may be writ- 
ten Q: 

Q = ^qiAi®B h (1) 
i 

where qi > 0, and Ai and Bi are orthonormal oper- 
ator bases for A and B, respectively. To prove the 
operator-Schmidt decomposition, expand Q in the form 
Q = Sjfe MjkCj <g> Dk, where Cj and Dk are fixed or- 
thonormal operator bases for A and B, respectively, and 
Mjk are complex coefficients. The singular value decom- 
position states that the matrix M with (j, fc)th entry Mjk 
may be written M = U qV, where U and V are unitary 
matrices and q is a diagonal matrix with non-negative 
entries. We thus obtain 

Q = J2 U i lC l lVlkC 3® Dk > W 
jki 

where qi is the Ith diagonal entry of q. Defining orthonor- 
mal operator bases A\ = J^j UjiCj and Bi = J^k VikDk, 
we obtain the operator-Schmidt decomposition, Eq. Q. 

To better understand the coefficients qi in the operator- 
Schmidt decomposition, imagine that associated with 
each system, A and B, there are reference systems, Ra 
and Rbi with the same state space dimensionalities, dA 
and ds, as A and B. Let 

l a ) = ^2\jRA3A)/\/dA~, and (3) 

j 



\0) = Y,\iB3R B )/Vd^ (4) 
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denote normalized, maximally entangled states of RaA 
and BRb, respectively. Now let £ be a general quantum 
operation [SBj ; we will shortly specialize to the case when 
£ corresponds to the action of U. We define p{£) to 
be the density operator resulting when £ acts on 
Writing this out explicitly, with subscripts to make it 
clear which operations are acting on which systems: 

p{£) = {l RA ®£ AB ®l RB )o{\a){a\®\(3){(3\), (5) 

where Xs denotes the identity quantum operation on a 
system S. In the special case when £ represents a unitary 
operation, U, on AB, we define ip(U) to be the quantum 
state obtained when U acts on |a)|/3), and let p(U) be 
the corresponding density operator. Note that we will 
interchange notations like ip(U) and \ip(U)), depending 
on which is more convenient in a particular context. 

The Schmidt coefficients of i/j(U) are closely connected 
to the operator-Schmidt coefficients of U, which we de- 
note Uj. Letting U = ^2jUjAj <g) Bj be an operator- 
Schmidt decomposition, we see that 

HU) = (I RA ®U®I RB )\a)\f3) (6) 
= Y, u ii I RA®Aj)\a){Bj®I RB m. (7) 

3 

Direct calculation shows that \/11a(I Ra ® Aj)\a) and 
\fd~B~(Bj <g) I Rb ) form orthonormal bases for RaA and 
BRb, respectively. Thus, we obtain the useful result 
that the quantum state ip(U) has Schmidt coefficients 
Ujj \fdA~d~B equal, up to the factor 1/v 'd A dB, to the 
Schmidt coefficients of U. 

The following proposition shows that the Schmidt co- 
efficients of U are continuous functions of U. In the state- 
ment of the proposition, ||M|| = maxi|^ii = i ||M|^)|| de- 
notes the usual operator norm. 

Proposition 1 Let U and V be operators on AB, with 
respective Schmidt coefficients Uj and Vj, ordered into 
decreasing order, u\ > u-i > and v\ > Vi > .... 
Then 

■>(\ H , ",' r ') <ll^-^|| 2 (8) 

To understand why Eq. (jSJl can be interpreted as 
a statement about continuity requires a little thought. 
Note that tr(C/ t {7) = tr(V^V) = d A d B , and thus 
Y^j u "j = Ylj v j = It follows that we can think 

of Uj/dAdB and v^/dAds as probability distributions. 
With this interpretation, the quantity 'Y^jiijVj / dAds is 
just the fidelity of these two probability distributions, 
and it follows from Eq. (JSJ that if U V then Uj ra Vj 
for all j. 

Proof: The key is to observe that the norm || • || is 
stable when extended trivially to an ancilla system, i.e., 
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|| M | = || M (£> I\\. Using this observation we have 

\\U-V\\ - \\Ir*®(U-V)®Ir b \\ (9) 
> \\(lRA®(U-V)®lR B )\a)\0)\\ (10) 
= \\^(U)-^(V)\\. (11) 

Squaring both sides of the inequality, and interchanging 
the roles of the two sides, we obtain: 

U(U)\\ 2 + H(V)\\ 2 -2Re((i;(U)\m))) <W~ V\\ 2 . 

(12) 

Since ||^(^)l| 2 = ||^(V)|| 2 = 1, this implies 

2(1-MU)MV))\)<W-V\\ 2 . (13) 

Since ip(U) and ip(V) have Schmidt coefficients 
Uj/y/dAds and Vj/y/d^B. respectively, it follows 
from the results of [H 03 that \{^{U)\i>{V))\ < 
J2j UjVj /dAds- Combining this inequality with Eq. i|13|) 
gives the desired result. □ 

III. SEPARABLE AND 
SEPARABILITY-PRESERVING QUANTUM 
GATES 

We now formally introduce the notion of separable 
quantum gates, and study their basic properties, in 
Sec. IIII Al Sec. IfflBl states and proves a theorem show- 
ing that quantum circuits built entirely out of separable 
quantum gates can be efficiently simulated on a classi- 
cal computer. Finally, Sec. IIII CI notes that the classi- 
cal simulation theorem of the previous subsection can 
be extended to a somewhat larger class of gates, the 
"separability-preserving" gates, and considers some of 
the implications of this fact. 

A. Definition and basic properties 

Suppose £ is a quantum operation acting on a com- 
posite quantum system with two components labeled A 
and B. £ is said to be separable if it can be given an 
operator-sum representation of the form 

£{p) = Y,{A j ®B j )p{A]®B]) (14) 

3 

Separable quantum operations were independently intro- 
duced in (ItI ITsf . where it was speculated that trace- 
preserving separable quantum operations might corre- 
spond to the class of quantum operations that can be 
implemented on a bipartite system using local opera- 
tions and classical communication. This speculation was 
false [l^. However, a related conjecture is true, namely, 
that trace-preserving separable quantum operations cor- 
respond to the class of trace-preserving quantum opera- 
tions which cannot be used to generate quantum entan- 
glement. This follows from an elegant characterization 



theorem of Cirac et al [20] linking separability of a quan- 
tum operation £ to separability of the quantum state 
p{£) introduced in Eq. JSJ. 

Theorem 1 (Operation-separability theorem [20]) 

A trace-preserving quantum operation £ is separable if 
and only if p{£) is a separable quantum state, that is, 
p{£) can be written in the form 

P {£) = Y.pipf AA ®p! RB ' ( 15 ) 

where the pj are probabilities, pf AA are quantum states 
of system RaA, and pf RB are quantum states of system 
BRb- 

When we say in the statement of the theorem that 
p{£) is separable there is initially some ambiguity, due 
to the multiple ways the system RaABRb can be de- 
composed into subsystems. To avoid this ambiguity, it 
is convenient to introduce notational conventions as fol- 
lows. Let a be a state of a composite system CD. We 
say er is separable with respect to the C : D cut if a can 
be written a = J^jPjPf ® pf for probabilities pj, and 
quantum states pf, p® of systems C and D, respectively. 
The advantage of this notation comes when more sys- 
tems are introduced. For example, when a is a state of a 
tripartite system, CDE, it is immediately clear what we 
mean by separability with respect to the C : DE cut, or 
with respect to the C : D : E cut, or other possible cuts. 
Thus, in the operation-separability theorem, the asser- 
tion is that £ is separable if and only if p(£) is separable 
with respect to the RaA : BRb cut. 

We have stated the operation-separability theorem for 
the case of trace-preserving quantum operations, but a 
similar result also holds for non trace-preserving quan- 
tum operations £. The only change is that the pj are no 
longer probabilities, but instead can be any set of non- 
negative real numbers. We have also restricted our atten- 
tion to bipartite quantum operations, that is, £ which act 
on quantum systems with just two components, A and 
B. It is not difficult to show that an analogous statement 
holds for fc-party quantum operations £. We do this by 
endowing each party with an associated reference system 
with which it is initially maximally entangled, and defin- 
ing p(£) to be the result of allowing £ to act on this initial 
state. £ is then separable if and only if p(£) is separable. 

An interesting corollary of the operation-separability 
theorem is that a quantum operation is separable if and 
only if it is incapable of producing entangled states. Fur- 
thermore, by connecting gate separability to state separa- 
bility, the operation-separability theorem allows us to ap- 
ply results from the theory of state separability to prove 
that certain gates are separable, and thus incapable of 
producing entanglement. 

The operation-separability theorem tells us that a 
trace-preserving quantum operation £ is separable pre- 
cisely when p{£) is separable. However, it does not follow 
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that all separable states of RaA : BRb can be written as 
p(£) for some trace-preserving quantum operation. To 
understand why this is the case, observe that when £ 
is trace-preserving, tr ab{p(£)) must be the completely 
mixed state of RaRb- In general, however, it is not dif- 
ficult to find separable states a of RaA : BRb such that 
tr ab(p) is not completely mixed. 

An elegant result of M., P., and R. Horodecki [2l| can 
be used to characterize precisely which separable states 
can be written in the form p(£) for trace-preserving, sep- 
arable £. Their result, which we have restated in the 
context of multipartite systems, is as follows: 

Theorem 2 The set of density matrices, a, of RaABRb 
such that a = p{£) for some trace-preserving quantum 
operation £ is precisely the set such that trAs(c) is the 
completely mixed state of RaRb- 

Combining this theorem with the operation- 
separability theorem we obtain the following result: 

Theorem 3 The set of density matrices, a, of RaABRb 
such that a — p(£) for some trace-preserving and separa- 
ble quantum operation £ is precisely the set such that (a) 
a is separable with respect to the RaA : BRb cut; and 
(b) trAB{<?) is the completely mixed state of RaRb- 



B. Separable gates and quantum computation 

Having discussed the basic properties of separable 
quantum operations, we now turn to their utility for 
quantum computation. Imagine a quantum circuit is 
built entirely out of separable quantum gates and single- 
qubit gates. It is intuitively plausible that such a quan- 
tum circuit can be efficiently simulated on a classical 
computer, and we now prove this result. The major tech- 
nical difficulty is issues involving the accuracy required 
in the simulation, and the associated computational over- 
head. 

Our model of quantum computation is as follows. Let 
Q be a fixed set of one- and two-qubit quantum gates. 
By "quantum gate" we mean a trace-preserving quan- 
tum operation. We assume that all the two-qubit gates 
in Q are separable. We let {C n } be a uniform family of 
quantum circuits P, containing p(n) gates, and act- 
ing on q(n) qubits, where p(n) and q(n) are polynomials 
in some parameter n. The initial state of the computer 
is assumed to be a computational basis state, \x). The 
computation is concluded by performing a measurement 
in the computational basis, yielding a probability dis- 
tribution p x {y) over possible measurement outcomes y. 
The measurement may be either on all the qubits, or on 
some prespecified subset. For instance, if one is solving 
a decision problem, it is only necessary to measure the 
first qubit of the computer, to get a single zero or one as 
output. 



What does it mean to simulate this computation ef- 
ficiently on a classical computer? Suppose we have a 
classical computer that, on input of x, produces an out- 
put y with probability distribution p x (y) . A good mea- 
sure of how well this simulates the quantum computa- 
tion is provided by the L\ distance. For probability dis- 
tributions r(y) and s(y) the L\ distance is defined by 
D(r(y), s(y)) = ^2 y \r(y) — s(y)\/2. Thus, we require that 
the Li distance D(p x (y),p x (y)) = \Px(y) - Px{y)\/2 
satisfies 

D( Px (y),p x (y))<e (16) 

for some parameter e > 0. We will show that the com- 
putational resources required to achieve this accuracy 
on a classical computer scale as 0(poly(p(n)/e)), where 
poly(-) is some polynomial of fixed degree not depending 
on the circuit family {C n }. Thus, high accuracies in the 
simulation can be achieved with modest computational 
cost. 

As an example of the practical implications of this re- 
sult, suppose {C n } is a uniform family of quantum cir- 
cuits solving a decision problem, outputting the correct 
answer to an instance, x, of the decision problem with 
probability at least 3/4. Our result implies that there 
is a classical simulation using 0(poly(p(n))) gates, and 
outputting the correct solution to the decision problem 
with probability 2/3. (The probability of obtaining the 
correct answer may easily be boosted up beyond 3/4 by 
a constant number of repetitions.) 

To analyze the method described below for classical 
simulation, we need the notion of the trace distance, a 
quantum generalization of the L\ distance. The trace 
distance, D(p, a) , between density matrices p and o is 
defined by [l| D(p,a) = trjp — er|/2. Note that we use 
the same notation D(-, •) for the trace distance and the L\ 
distance, with the meaning to be determined from con- 
text. The properties of the trace distance are discussed 
in detail in [l|, and we need only a few properties here: 

• The trace distance satisfies the triangle inequality, 
D{p,T)<D{p,o)+D{a,T). 

• The trace distance is doubly convex, meaning that if 
Pj are probabilities, and pj and <7j are correspond- 
ing density matrices, then 

D I Y^PjPj'Y^PW ^ ^l'j lh l'r' T j:- (17) 
\ 3 j / j 

• The trace distance is contractive. That is, if 
£ is a trace-preserving quantum operation, then 
D(£(p),£(a)) <D(p,a). 

• The trace distance has the stability property D{p\® 
o--.pi ®a) = D(pi,p 2 ). 

• Suppose E y are POVM elements describing the 
statistics from an arbitrary quantum measurement. 
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Let r(y) = tr(pE y ) and s(y) = tr(aE y ) be the cor- 
responding probability distributions for p and a. 
Then the L\ distance and the trace distance are 
related by the inequality 



D(r(y),s{y)) < D(p,a) 



(18) 




for some constant c that does not depend on £, A or B. 
To see that this is possible, we make use of the fact that 



We now describe how the classical simulation is per- 
formed, followed by an analysis to determine the accuracy 
of the simulation. 

Variables used in the classical simulation: For 

each j = 1, . . . , q(n) we let Sj be a three-dimensional real 
vector. Each vector Sj is valid, meaning that it has the 
following three properties: (a) Each component of Sj is 
in the range [—1,1]; (b) Each component is specified to I 
bits of precision, where I is a number that will be fixed by 
the later analysis, in order to ensure the overall accuracy 
is at least e; and (c) ||sj|| < 1. 

We use the notation s = (si , . . . , s 9 ( n ) ) to denote the 
3(7(n)-dimensional real vector containing all the SjS as 
subvectors. We say that s is valid if each Sj is valid. It 
will also be convenient to introduce the notation 



£ 



I + s% ■ 3 



I 



5g(„) 



(19) 



Note that p(s) is a legitimate density operator of q(n) 
qubits, whenever s is valid. The idea of the classical 
simulation is that the variables s will be used to represent 
the state p(s). Note that p(s) is not a variable used in the 
classical simulation; it is simply a mathematical notation 
convenient in the analysis of the simulation. 

Initial state of the classical variables: Suppose 
the initial state of the quantum computer is \x), where x 
has binary expansion x\ . . . x q ( n ) . If Xj — we set Sj — 
(0,0,1) initially, while if Xj = 1 we set Sj = (0,0,-1) 
initially. 

Simulating a single-qubit gate: A single-qubit gate 
can be regarded as a two-qubit separable gate in which 
one of the qubits is acted on trivially. Thus, we need only 
consider the case of two-qubit separable gates. 

Simulating a two-qubit separable gate: Suppose 
£ is a two-qubit separable gate, and it acts on qubits 
A and B. We simulate this gate by using s as input to 
the following stochastic gate simulation procedure, which 
produces a valid 3(/(n)-dimensional vector, s', as output. 
We then set s = s', and repeat over, going through each 
gate, £%,..., £ p (w\, in the computation, until a final out- 
put value of s is produced, at which point we proceed 
to the simulation of the final measurement, as described 
below. 

Gate simulation procedure: 

Input to the procedure: A valid vector, s. 

Body of the procedure: Find valid three- vectors ff A and 
s^ B , a probability distribution, pj, containing at most 16 
elements, and with each pj specified to / bits of precision, 
such that 



D\ £ 



I + sa ■ 3 I + sb ■ 3 



I + s A ■ a I + s B ■ 3 



(21) 



is a separable, two-qubit state, and therefore, by 
Caratheodory's theorem p3| . can be written in the form 
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I + P A -3 I + P B -3 



(22) 



where the qj are probabilities, i\'^B are real-three vec- 
tors satisfying ||t^||, ||^|| < 1, and there are at most 16 
terms in the sum. Choosing the pj to be probabilities 

which are Z-bit approximations to the qj, and the s° A , s^ B 
to be valid vectors which approximate t? A , P B also to / 
bits, we obtain the result. 

Note that while Caratheodory's theorem ensures that 
such probabilities and vectors exist, finding them may 
be a non-trivial task. The obvious technique, a brute 
force search over probability distributions and valid vec- 
tors, requires poly(2 z ) operations, where poly(-) is some 
fixed polynomial function. Although we believe it likely 
that better techniques — perhaps even polynomial in I 
— are possible, for the purposes of the present simulation 
poly(2 z ) turns out to be sufficient. 

Output of the procedure: For k ^ A 7 B we define s\ = 
sj,. Set s* J = {s\, . . . , s^/ n ))- Note that s 5 is valid, by 
construction. With probability pj , output s' = tP. 

Simulating the final measurement in the com- 
putational basis: Let S be the subset of qubits that 
is measured at the output of the quantum computation. 
For each k £ S, let be the third component of s*fc. 
The measurement result for that qubit is with proba- 
bility (l + s|)/2, and 1 with probability (l-s 3 k )/2. Note 
that, by definition, p x {y) is the distribution over possible 
outcomes, y, produced by following this procedure. 

Analysis: The key to the analysis of the classical 
simulation is a simple equivalence between the classical 
simulation and certain measurements on quantum states. 
Suppose we define p m (s) to be the probability distribu- 
tion on valid vectors after m steps of the simulation pro- 
cedure, that is, after £\, . . . , £ m have been simulated. For 
m = 0, . . . ,p(n) define 



a m = J2i> m ($)p(?)- 



(23) 



It is not difficult to see that the distribution obtained by 
measuring <j p ( n ) in the computational basis of the subset 
S is exactly the same as the output distribution p x (y) 
produced by the classical simulation. 
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For m = 0, . . . ,p(n) define <j m to be the state of the 
actual quantum computer after m gates have been ap- 
plied. Thus a = |x)(a;|, a 1 = £i(er°), and so on. The 
idea of the proof that the classical simulation works well 
is to bound the distance between a m and a m . We do this 
using the following lemma. 

Lemma 1 Suppose a valid vector s is used as input to 
the gate simulation procedure with probability p(s), and 
letp(s') be the corresponding output distribution on valid 
vectors. Define 

a = £>(5)p(S) (24) 

S 

a' = (25) 

8 ' 

If the gate simulation procedure simulates the gate £, then 
we have 

D{£ (a), a') < c2~\ (26) 

where c is the constant introduced earlier in the discus- 
sion of the gate simulation procedure. 

Proof: Let p(s'\s) be the probability that s' is output 
by the gate simulation procedure, given that s is input. 
Then we have p(s') = '|s)p(s), so 

° = £^1 £?(*>>(*')• (27) 

S s' 

Applying the double convexity of the trace distance gives 

D{£{a),o>) < Y^p&D ^(p(S)),X>(*'|S)p(*')) • 

(28) 

By inspection of the construction used in the gate sim- 
ulation procedure, notably Eq. (|2UI) . and the stability 
property for trace distance, we have 

D (sip^Y^pis'l^pis'^j < c2- 1 . (29) 

Combining this observation with Eq. I|28[) gives 

D(£(a),a') < c2-\ (30) 

which was the desired result. □ 

Proposition 2 For m — Q,...,p(n), D(a m ,a m ) < 
cm2- 1 . 

Proof: We induct on m. For m — the result fol- 
lows from the fact that <t° = cf°. Assuming the result is 
true for m, we now prove it for m + 1. By the triangle 
inequality 

D(cr m+ \a m+1 ) < D(a m+1 ,£ m+1 (a m )) 

+D(£ rn+1 (a m ),a m+1 ). (31) 



By definition a m+1 — £ m +i(a m ), so this equation may 
be rewritten 

D(a m +\a m+1 ) < D(S m+1 (a m ),£ m+1 (a m )) 

+D(£ m+1 (a m ),a m+1 ). (32) 

Applying the contractivity of the trace distance to the 
first term, and Lemma[l]to the second term, we obtain 

D(a m+1 ,a m+1 ) < D(a m ,a m ) + c2~ l . (33) 

Applying the inductive hypothesis to the first term gives 

D((T rn+1 ,a m+1 ) < cm2~ l + c2~ l = c{m + 1)2"', (34) 

which completes the induction. □ 
We conclude from the proposition that 
D((T?W,^( n )) < cp(n)2~ l . It follows from Eq. fTHjl 
that the simulated distribution p x (y) and the actual 
distribution p x (y) are related by the inequality 

D( Px (y),p x (y))<cp(n)2- 1 . (35) 

Choosing I = [log 2 (cp(n)/e)] we therefore have 

D( Px (y),p x {y))<e. (36) 

The total number of times the gate simulation proce- 
dure is performed is p(n) , and the number of operations 
performed in one iteration of the gate simulation proce- 
dure scales as poly(2 z ), so the total number of operations 
in the classical simulation is 0(poly(p(n)/e)), where we 
abuse notation by letting poly(-) be a (new) polynomial 
function. We have proved the following theorem: 

Theorem 4 Let Q be a fixed set of one- and two-qubit 
gates. Suppose all two-qubit gates in Q are separable. 
Let {C n } be a uniform family of quantum circuits of size 
p(n), acting on q(n) qubits, where both p(n) and q(n) are 
polynomials. The initial state of the computer is assumed 
to be a computational basis state, \x) . The computation 
is concluded by performing a measurement in the com- 
putational basis on some prespecified subset, S , of the 
qubits, yielding a probability distribution p x {y) over pos- 
sible measurement outcomes y. Then for any e > it 
is possible to sample from a distribution p x (y) satisfying 
D{p x {y),p x {y)) < e using a classical algorithm taking 
0(poly(p(n)/e)) steps, where poly(-) is some fixed poly- 
nomial. 

Results related to Theorem 0] have been obtained in 
the past, but, so far as we have determined, no proof 
of this result has previousl y b een published. In particu- 
lar, Aharonov and Ben-Or [TlJ studied the role of entan- 
glement in quantum computation, proving that many- 
party entanglement must be present in order for a quan- 
tum computation to be difficult to simulate classically. 
This conclusion was subsequently clarified and extended 
by Jozsa and Linden [24|. However, the conclusions of 
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both and [24| are not applicable in the present con- 
text, since they apply in the context of pure state entan- 
glement of a quantum computer, rather than the mixed 
state case considered in this paper. 

The issue of mixed state quantum information process- 
ing was considered by Braunstein et al [25j, who raised, 
without answering, the question of what role mixed-state 
entanglement can play in quantum computation. This 
line of thought has been carried further by many authors, 
without completely answering the question. See [2^, |2tJ 
for recent work and further references. 



C. Separability-preserving gates 

It is straightforward to extend the proof of Theorem 0] 
in a variety of ways, without changing the conclusion that 
a classical simulation of the quantum circuit is possible. 
In particular, we can change the gates in Q so they can 
act on any bounded number of qudit systems, rather than 
two-qubit systems. 

Furthermore, the proof relies on properties of gates in 
Q that are weaker than separability. In particular, the 
gates in Q need only be separability-preserving, that is, 
£ (p) is separable for any separable state p. We denote the 
class of separability-preserving gates by SP. To see that 
this is a weaker property, note that SWAP is separability- 
preserving since it maps product states to product states, 
but SWAP is not separable, since it can generate entan- 
glement with the aid of local ancilla systems. More gen- 
erally, note that Sab is separable with respect to A : B 
if and only if £ ab ®1a'B' is separability- preserving with 
respect to A A' : BB' . 

Since the proof of Thcorcm0]only relied on the state in 
Eq. H21J) being separable, it still holds when the available 
gates are all separability-preserving. However, no sim- 
ple and easy-to-use characterization of the separability- 
preserving gates is known, which is why we prefer, for 
most of the remainder of this paper, to work with the 
separable gates. We do make occasional later use of 
separability-preserving gates, so it is convenient to note 
here a few of their properties. Note that all separable 
gates are in SP, and for gates operating on multiple qu- 
dits, any permutation of the qudits (for example swap) 
is in SP. Furthermore, SP is convex and is closed under 
composition, so 

SP D Hull {£ o V : £ separable and V a permutation} . 

(37) 

However, it is unclear whether this convex hull describes 
all of SP. For example, the operation which measures a 
pair of a qubits in the Bell basis and stores the answer in 
the computational basis (i.e. (|00) + |ll))/\/2 becomes 
|00), Cj00> — jll>)/v / 2 becomes |01), etc. ..) is certainly in 
SP though is docs not seem as though it can be expressed 
as a convex combination of £^ o Vk for separable £k and 
permutations Vk [36| . 



IV. ROBUSTNESS OF QUANTUM STATES 

To understand how robust quantum gates are to noise, 
it is useful to first review prior work on the robustness of 
entangled quantum states. This section describes Vidal 
and Tarrach's definitions and results on the robust- 
ness of quantum states, introduces a novel measure of 
robustness, and relates that measure to Vidal and Tar- 
rach's measure. The novel measure and its properties will 
be of especial interest in applications to gate robustness. 

Let p be a quantum state of a bipartite system, AB, 
and let a be a state of AB. Vidal and Tarrach [T3I de- 
fine the robustness of p relative to a, R(p\\a), to be the 
smallest non-negative number t such that the state 



is separable. Equivalcntly, we can define R(p\\o~) to be 
the smallest non-negative number t such that p + ta is 
separable; this latter definition in terms of unnormalized 
quantum states will frequently be useful in later work. 
Note that ^3] specify that a be separable; however, we 
will find it convenient to extend the definition to non- 
separable a also, specifying that R(p\\a) = +00 if no 
value of t exists such that the state in Eq. (|38|) is separa- 
ble. At first sight one is tempted to ask why we choose 
this definition for the robustness, and not the related 
quantity 

min{p : p > 0, (1 — p)p + pa is separable}. (39) 

This latter definition has a more obvious physical inter- 
pretation as the minimal probability with which a can be 
mixed with p to obtain a separable state. It follows from 
the definitions that the quantity of Eq. I|39|) is equal to 
R{p\\a)/(1 + R(p\\a)). The reason we do not work with 
the quantity of Eq. H39J) ■ despite its apparently more com- 
pelling physical interpretation, is that the robustness de- 
fined in Eq. (|38|l has useful and easy-to-prove convexity 
properties not satisfied by Eq. (j3"9"|) . namely, R(p\\a) is 
convex in both the first and the second entry. 

A special case of R(p\\a) of particular interest is the 
random robustness, defined to be the robustness of p rel- 
ative to the maximally mixed state I/dAds- We de- 
note the random robustness of a state p by R r {p) = 
R(p\\I/d A d B ). Vidal and Tarrach [l^ found a useful for- 
mula for the random robustness of a pure state ip of AB 
in terms of a Schmidt decomposition tp — iftj\j)\j) 
with ordered Schmidt coefficients ipi > ^2 > • • • > 0: 

R r {tl))=i) l i)2d A d B . (40) 

So far we have discussed the robustness of a state p 
relative to another fixed state a. We now define the ro- 
bustness of p, R(p), to be the minimum relative robust- 
ness R(p\\a) over all separable a. Thus, the robustness 
of p is a measure of how much local noise can be mixed 
with p before it becomes separable. 
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We have defined three notions of robustness for quan- 
tum states, R(p\\cr), R r (p), and R(p). All three of these 
definitions have assumed that p is a state of a bipartite 
quantum system, AB. However, robustness is easily ex- 
tended to more than two parties, and it is convenient to 
have a notation to express the extended notion. Suppose, 
for example, that p and o are states of a tripartite system 
ABC. Then R A:B:C (p\\o) is defined to be the minimal 
value of t such that p + to is separable with respect to 
A: B.C. 

Of course, a many-party quantum system can be de- 
composed in many different ways, by grouping subsys- 
tems together. So, for example, we can define a notion 
of robustness, R ABC (p\\o), when system B and C are 
grouped together. More explicitly, R ABC (p\\o) is de- 
fined to be the minimal value of t such that p + to is 
separable with respect to A : BC. 

These examples may be extended in a natural way to 
the random robustness and robustness, as well as to the 
case where more systems are present, and to more com- 
plicated groupings of subsystems. Most of our work con- 
cerns two-party robustness, and so we usually do not ex- 
plicitly include superscripts in expressions like R AB (p). 

The robustness has many useful properties, which are 
explored in detail in 0. We mention just a few of the 
more striking properties here. The robustness is invari- 
ant under local unitary operations. Moreover, it is an 
entanglement monotone, that is, cannot be increased un- 
der local operations and classical communication. It is 
also a convex function of p. As for the random robust- 
ness, Vidal and Tarrach [TJ have obtained an elegant 
formula for the robustness in the special case of a pure 
state, ip, of a bipartite system, AB, 

where ipj are the Schmidt coefficients for ip. In the course 
of their proof, Vidal and Tarrach explicitly construct a 
state, o,p, such that IV'XV'I + R(ip)&ip is separable, o^ 
may be expressed in terms of the Schmidt decomposition 

i> = Ej i>i I j) I J) by 

^ = J^Yl^l\k)(k\®m\- (42) 

In the definition of robustness we mixed p with a 
separable quantum state, o, trying to determine what 
minimal level of mixing will produce separability. An- 
other natural definition of robustness would allow o to 
range over arbitrary density matrices, not just separa- 
ble density matrices. That is, we can define R g (p) = 
min CT R(p\\o), where the g subscript indicates that we are 
minimizing globally over all possible density matrices o. 

How are R g {p) and R(p) related? It is clear from the 
definitions that R g {p) < R(p). We will prove that the 
reverse inequality is also true when p = ip is a pure state, 



so we have 

RgW = RW = ^E^j - L 

We do not know whether R g {p) = R(p) in general. To 
complete the proof of Eq. (|43|l . we show that if there 
exists a density operator o such that 

ip + to (44) 

is separable, then t > (X^^i) 2 ~~ 1- (Our proof both 
extends and simplifies a similar proof in [12j for the ro- 
bustness R(p)-) 

The proof is based on the positive partial transpose 
criterion of Peres |2^| . Let us denote the partial transpose 
on systems A and B by Ta and Tg, respectively. Then 
the positive partial transpose criterion implies that if the 
state of Eq. I|44|) is separable, then 

< %P Tb +to TB , (45) 

where < indicates an operator inequality, that is, we are 
saying that the operator on the right-hand side is posi- 
tive. 

We now use Eq. I|45|) to deduce a lower bound on t. To 
do this we introduce an operator, M, defined by M 
! — swap, where swap = J2jk ® 1^)01 i s the linear 
operator interchanging states of system A and system B. 
Note that M is positive, since swap 2 = / implies that 
SWAP has eigenvalues ±1, and thus M is a diagonalizablc 
operator with eigenvalues and 2. 

Since the trace of a product of two positive operators 
is non-negative, it follows from Eq. (|45|l that 

< tr (M^ Tb ) + 1 tr (Mo Tb ) . (46) 

Using a little algebra and the observation that for any two 
operators, K and L, tr(KL TB ) = ti(K TA L), the previous 
equation may be rewritten 

- tr (M Ta V>) < t tr (M Ta o) . (47) 

Direct calculation shows that 

M Ta = I- \a)(a\, (48) 

where \a) = ^ - \ j)\j) is the (unnormalized) maxi- 
mally entangled state. Using Eq. l|48|l it follows that 
tr (M TA if) = 1 - (J2j ipj) 2 and t r (M Ta o) < tr(o) = 1. 
Substituting these results into Eq. 147J1 gives 

which was the desired bound. 
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V. ROBUSTNESS OF QUANTUM GATES 

We now extend state robustness to quantum gates. 
Suppose £ and T are trace-preserving quantum opera- 
tions on a composite system AB. Then we define the 
robustness of £ relative to T, R{£\\J-), to be the mini- 
mum value of t such that 



1 



1 + t 



-£ 



t 



l + t 



-T 



(50) 



is separable. Equivalently, R(£\\!F) can be defined to be 
the minimal value of t such that £ + tT is separable. 
Applying the operation-separability theorem we immedi- 
ately find the useful formula 



R{£\\T) = R R - A - BR -{ P {£)\\p{T)). 



(51) 



Just as for quantum states, the notion of gate robust- 
ness extends in a natural way to systems of more than 
two parties, and we use notations analogous to those in- 
troduced earlier, such as R A:B:C (£\\ T) and R A:BC (£\\F), 
to describe this scenario. Note that these notations will 
also be extended in a natural way to the random robust- 
ness and robustness of a quantum gate, as defined below. 
As for quantum states, when identifying superscripts are 
omitted we assume that the quantum gate in question 
acts on a bipartite system, AB. 

Motivated by several different classes of noise com- 
monly occurring in physical systems, we now use the 
notion of relative gate robustness to define and study sev- 
eral different measures of robustness for quantum gates. 
First is the random robustness, which we define and study 
in Sec. IV Al Also in this subsection, we use results on 
the random robustness to place bounds on the thresh- 
old for quantum computation. Two other measures of 
robustness are the separable robustness and the global 
robustness, which we define in Sec. lVBl and use to prove 
bounds on the threshold for quantum computation. Our 
results on these measures of robustness are less complete, 
and so our discussion is more limited. 



A. Random robustness of quantum gates 

1. Definition and basic properties 

The random robustness of £, R r {£), is defined to be 
equal to the robustness of £ relative to the completely 
depolarizing channel, T>(p) = I/dAds for all states p of 
system AB: 



R r {£) = R(£\\V). 



(52) 



The random robustness is especially interesting because 
it measures the robustness of £ against complete ran- 
domization of systems A and B. Another way of stating 
this is to imagine that we are applying the operation £ 
with probability 1 — p, and randomizing the systems A 



and B with probability p. Then the threshold probabil- 
ity at which this gate crosses the separable-inseparable 
threshold is: 



R r (£) 
l+R r {£)' 



(53) 



From Eq. H51[) we see that the random robustness for an 
operation is related to the random robustness of a state 
by 



R r {£) =R RaA - BRb { p {£)). 



(54) 



Specializing to the case where £ is a unitary quantum 
operation, U, we see that R r (U) = R RaA ' BRb (p(U)). 
However, p(U) is a pure state. We showed earlier that 
p(U) has Schmidt coefficients Uj/y/dAds, where uj are 
the Schmidt coefficients of U. This observation, together 
with Eqs. I|54l) and 14UII implies the formula 



R r {U) — dAdBU\u 2 , 



(55) 



where wc order the Schmidt coefficients of U so that 
u\ > U2 > . . . > 0. (Note that in deriving this equation, 
we have replaced dA by d\, and ds by d 2 B in Eq. H4HJI. 
since we are working with robustness for the RaA : BRb 
system.) 

It is, perhaps, not immediately clear what the physical 
relevance of the random robustness is. After all, in real 
physical systems, the effects of noise on a quantum gate 
will not usually be to simply mix in some depolarization, 
together with the gate. Despite this, there is still a very 
good physical reason to be interested in the random ro- 
bustness. The reason is that, as we show in more explicit 
detail below, the random robustness can be used to ana- 
lyze the particular noise models which have been used in 
estimating bounds on the threshold for quantum compu- 
tation. In turn, it has been argued 

00I1ISII3 that by 
analyzing and correcting for the effects of noise in those 
particular models, it is possible to make general state- 
ments about a wide class of physically reasonable noise 
models. Thus, although the physical scenario considered 
in the definition of the random robustness appears rather 
specialized, it will enable insight into much more general 
physical situations. 

As an example, we may ask how robust the controlled- 
not is against the effects of depolarizing noise? 
The controlled-NOT has Schmidt decomposition ^| 
\/2|0)(0| (^//v / 2 + \/2|l)(l| ®X/V2, so Eq. © implies 
that i? r (CNOT) = 8. Interestingly, we can also show that 
the controlled-NOT is the most robust two-qubit gate. 
To see this, note that unitarity of U implies that the 
Schmidt coefficients Uj satisfy ^2jU 2 = cU^s, and thus 
u\ + u\ < dAds- It follows from this observation and 
Eq. (JSSJ that R r (U) < d 2 A d 2 B /2, and thus no two-qubit 
unitary gate can have random robustness greater than 
8, which is the random robustness of the controlled-NOT. 
These results are worth highlighting as a proposition: 

Proposition 3 For any quantum operation £, R r {£) < 
d 2 A d%/2. Ifd A =d B =2 then R r {£) < R r (CNOT) = 8. 
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The random robustness has many physically interest- 
ing properties. Below we list six easily-proved properties, 
before discussing in more depth two less easily-proved 
properties. Our discussion of these properties is, in 
part, motivated by the framework of "dynamic strength" 
measures introduced in [13j |. although the properties we 
discuss are interesting independent of that motivation. 
In 01 it was argued that these properties, especially 
the property of chaining, discussed below, are essential 
if a measure can be said to quantify the strength of a 
quantum dynamical operation as a physical resource. By 
showing that these properties are satisfied, we thus show 
that the random robustness is a good measure of dynamic 
strength. 

1. Non- negativity and locality: R r {£) > with 
equality if and only if £ is a separable quantum 
operation. 

2. Local unitary invariance: If U A , Mb, Va, Vb are 

all local unitary quantum operations, with the sys- 
tem being acted on indicated by the subscript, then 

R r {{U A ®U B ) o £ o (V A ®V B )) = R r (£). (56) 

3. Exchange symmetry: R r (£) = i? r (swAP o £ o 
swap), that is, the random robustness is not af- 
fected if we interchange the role of the systems. 

4. Time-reversal invariance: For a unitary, U, 
R r (U) = R r {U r ). 

5. Convexity: The random robustness R r {£) is con- 
vex in £. 

6. Reduction: Suppose a trace-preserving quantum 
operation £ acting on AB is obtained from a trace- 
preserving quantum operation T acting on ABC as 
follows: 

£{pab) = tr c [Hpab ® o-c)] , (57) 

for some fixed state o~c of system C. Then the 
random robustness satisfies the reduction property, 
namely, Rf B (£) < R^ BC {T). 

The random robustness satisfies two other physically 
interesting properties that are more difficult to prove. 
First of all, the random robustness is continuous in £. 
Physically, this is self-evident: making a small change in 
£ should not too drastically affect its robustness against 
the effects of noise. We now prove a quantitative form of 
this statement for unitary gates. 

Proposition 4 (Continuity of random robustness) 

Let U and V be unitary gates acting on a system A of 
dimension d A , and a system B of dimension ds- Then 

\R r (U) - Rr(V)\ <d M d\d%\\U-Vf, (58) 

where du = min(<i J 4, ds). 



Proof: Let Uj and vj be the ordered Schmidt coeffi- 
cients of U and V, respectively. From Eq. 155fl . 

\R r (U) - R r (V)\ = dAdB\uiU2 - v\v 2 \ 

= d A d B \(ui - Vi)u 2 +Vi(u 2 - v 2 )\ 

< d A d B \ui - vi\\u 2 \ + \vi\\u 2 - v 2 \ 

< d\d\ (|tti - ui| + \u 2 - v 2 \) 

< d A d 2 B J2\ Uj - vA, (59) 

3 

The second part of the proof is to observe that by the 
Cauchy-Schwartz inequality, 

3 3 

= 2d M d A d B 1 - ' , • (61) 
V d A d B J 

Applying Proposition ^ we obtain 

X>j-uj| <d M d A d B \\U-V\\ 2 (62) 

3 

Combining with Eq. (|59|l gives the result. □ 
Another physically interesting question is to ask how 
the random robustness of a gate £\o£ 2 composed of quan- 
tum gates £\ and £ 2 relates to the random robustness of 
the individual gates. The following proposition bounds 
the random robustness of the combined operation: 

Proposition 5 (Chaining for random robustness) 

Let £\ be a doubly stochastic quantum operation, that 
is, a quantum operation which is both trace-preserving 
and unital (i.e. £\{I) — I), and let £ 2 be an arbitrary 
trace-preserving quantum operation. Then 

R r {£i o £ 2 ) < RriSi) + R r {£ 2 ) + i? r (£i)i? r (£ 2 ). (63) 

Note that unitary operations are trace-preserving and 
unital, so the proposition is true when £\ and £ 2 are 
unitary. There is an equivalent way of phrasing Propo- 
sition that is physically more intuitive. Suppose we 
define 

C r (£) = \n{l + R r (£)). (64) 

Then C r (£) is monotonically related to the random ro- 
bustness of £, and thus can be thought of as carrying 
the same qualitative information about the robustness of 
the gate. Simple algebra shows that the conclusion of 
Proposition [S] may be recast in the form 

C r (£i o£ 2 )<C r {£ 1 ) + C r {£ 2 ). (65) 

The simplicity and clarity of this form may, perhaps, 
make it more useful in some circumstances. 

Proof: By definition of the random robustness, the 
quantum operations 

£i+R r (£i)V, and (66) 
£ 2 + R r (£ 2 )T> (67) 
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are separable quantum operations. Furthermore, since 
the composition of two separable quantum operations is 
separable, and £\ o T> — T> o £ 2 — T> o T> = T> (using 
the unitality of £\), we can compose the operations of 
Eqs. ||BBJ and @3 to see that 

Ex o £ 2 + {Rr(£i) + Rr(S 2 ) + i? r (£i)i? r (£ 2 )) V 

(68) 

is separable, and thus 

Rr(£l o £ 2 ) < Rr(£l) + Rr{£2) + Rr {£ 1 ) Rr (£ 2 ) , (69) 

as required. □ 



2. Random robustness and the threshold for quantum 
computation 

Suppose we are trying to do fault-tolerant quantum 
computation using single-qubit gates and some entan- 
gling two-qubit unitary gate, U. U might be the 
controlled-NOT gate; it can also be any other entangling 
two-qubit gate, at least in principle [23,13(1, an d still be 
capable of universal quantum computation when assisted 
by single-qubit gates. Suppose furthermore that the U 
gates are afflicted with noise of a special type, namely, 
immediately after a gate acts, each qubit is independently 
depolarized with probability p. Let U{p) = U pU^ denote 
the quantum operation corresponding to U . Then the 
quantum operation describing this noise process is 



£{p) = (l-p) 2 U(p)+ P (l-p)(V®l)oU(p) 
+p{l-p){l®V)oU{p) + p 2 {V®V) c 



U(p). 
(70) 



Note that (V®V)oU{p) = (V®V){p), so this expression 
can be simplified to 



£{ P ) = (l- P ) 2 U(p)+p(l 
+p(l-p)(T®X>) o 



-p)(V®l)oU{p) 
U{p)+p 2 {V®V){p). 



(71) 



This expression cannot immediately be analyzed using 
our expressions for the random robustness of a gate, due 
to the two terms in which a single qubit is depolarized. 
Fortunately, we can simplify the analysis by showing that 
these terms are always separability-preserving, that is, 
(£> ® J) o U and {1®V)oU are both in SP. This holds 
because for any p, (V®X)oU{p) = <S>tr AUpW , which 
is manifestly separable, and a similar result holds for (X® 
T>) o U. Note that such gates may not be separable: for 
example (X> <g> T) o swap + (J ® V) o swap is separability- 
preserving, but not separable. 

From this observation, and Eq. 1|71|1 . it follows that £ 
is in SP if (1— p) 2 U + p 2 (D ®T>) is separable. Comparing 
with the earlier results on random robustness, we see that 



this becomes true when p 2 /(l — p) 2 — R r (U) - 
see that £ will be separability-preserving when: 



P> 



R r (U) - y/RAU) _ 8 - y/8 



R r (U) - 1 
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0.74, 



We 



(72) 



and thus, when this condition is satisfied, the quantum 
computation may be efficiently simulated on a classical 
computer. If we assume, as is usually done, that quantum 
computers may not be efficiently simulated on a classical 
computer, then it follows that the threshold for quantum 
computation is guaranteed to be less than 0.74. 

In their work on obtaining upper bounds for the thresh- 
old, Aharonov and Ben-Or [ll| considered a similar 
model of quantum computation, in which each qubit is 
independently dephased after each quantum gate. The 
main difference between their model and ours is that 
we have used depolarizing, rather than dephasing noise. 
Which of these more accurately describes the noise oc- 
curring in a real physical system depends, of course, upon 
the physical system in question. Aharonov and Ben-Or 's 
obtained an upper bound of p t h < 0.97; of course, this 
cannot be directly compared to our upper bound, since 
the noise models are different. 



B. Robustness against more general noise 

1. Definitions and general results 

Depolarization is only one of the many kinds of noise 
that can afflict a quantum gate. Other classes of noise 
motivate other measures of robustness for quantum gates. 
We now introduce two more measures of robustness, 
based on two particularly natural classes of noise. The 
first measure is the separable robustness, which mea- 
sures the resilience of the gate against separable noise. 
The separable robustness, R s (£), is defined to be the 
minimum relative robustness R{£ \\!F) over all separa- 
ble, trace-preserving quantum operations T . The second 
measure is the global robustness, which measures the re- 
silience of the gate against arbitrary noise. The global 
robustness, R g {£), is defined to be the minimum relative 
robustness R(£\\T) over all trace-preserving quantum op- 
erations T . 

A priori, it is apparent that R g {£) < R s {£), but it 
is not clear whether or not the two quantities are equal. 
Furthermore, the gate robustnesses may be related to 
state robustness by the following inequalities: 



R(p(£)) < R s (£), 
R(p(U)) < R g {U). 



(73) 
(74) 



To see the first of these inequalities, note that £+R s (£)!F 
is separable, for some separable quantum operation T . It 
follows that 



P (£ + R S {£)T) = P {£) + R s {£)p{T) 



(75) 



12 



is a separable quantum state. Since p(J-) is separable, 
Eq. (73jl follows from the definition of R(p(£)). The proof 
of Eq. I|74(l follows similar lines, but also makes use of the 
fact, noted in Eq. 143( 1. that R g (ijj) — R{i^) for any pure 
state ip. 

Do the inequalities (|73() and l|74l) hold with equality? 
We do not know the answer to this question, but suspect 
that the answer is, in general, "no", in both cases. Our 
reasoning for this suspicion is as follows. Recall from 
Sec. Mil in particular, Theorem^! that not all separable 
states can be written as p(J^) for some separable quantum 
operation T . Recall also the construction, Eq. I|42(l . used 
in finding the separable which minimizes R(tp\\o~,p). 
Using this construction, it is not difficult to find examples 
of unitary U for which the separable state <J p (u) does not 
correspond to any trace-preserving, separable quantum 
operation, as characterized in Theorem [3] 

Fortunately, there is a large and interesting class of 
gates for which the inequalities 1(73(1 and 1(74(1 hold with 
equality. This class includes the controlled-NOT and 
SWAP gates. 

Theorem 5 Let U be a bipartite unitary gate acting on 
systems A and B with dimensions d& and ds- Assume 
U has Schmidt decomposition 

3 

where the Aj satisfy AjA^ — I jd^ and the Bj satisfy 

BjBj = I/ds- That is, the Aj and Bj are all propor- 
tional to unitary operators. Then 



R g {U) = R S {U) = R(p(U)) 



dAdi 



1. (77) 



Furthermore, the quantum operation T defined by 
Efay u kui (A k ® B{) p (a\ ® B\) 

Hp) = — ^ ^ L (78) 

is an instance of the type of noise against which U is least 
robust. That is, T is trace-preserving, and U + R(p(U))J r 
is separable. Finally, T is manifestly separable. Indeed, 
T can be implemented by local operations and classical 
communication. 

The application of the theorem of most interest for us is 
the CNOT. It is not necessarily obvious that the CNOT has 
a Schmidt decomposition with the properties required by 
the theorem; after all, we earlier wrote the Schmidt de- 
composition for the CNOT as \/2|0)(0|(8)//v / 2+\/2|l)(l|<8) 
and this is not of the required form. However, 
while the Schmidt co-efficients are unique, the opera- 
tors appearing in the Schmidt decomposition may not 
be unique, when two or more of the co-efficients are de- 
generate. It turns out that there is an alternative form of 



the Schmidt decomposition for the CNOT which is of the 
right form. This follows, for example, from Proposition 4 
of |ll|, and can also be verified directly, with a little al- 
gebra. The explicit form is not particularly illuminating, 
so we omit it here. 

Eq. (|77|) now tells us that _R ff (CNOT) = i? s (CNOT) = 1. 
Comparing with the random robustness, i? r (CNOT) = 8, 
we see that the CNOT is substantially less robust against 
general noise than depolarizing noise. From Eq. (|78|l we 
deduce that the quantum operation 



Hp) = (|o)<o|®i)p(|o)<o|®J) 

+ (|l)(l|®X)p(|l)(l|®X) 



(79) 



is an example of a noise process such that CNOT + T is 
a separable quantum operation. Note, furthermore, that 
T can be implemented via local operations and classical 
communication, by measuring the first qubit, and then 
conditionally applying / or X to the second qubit, de- 
pending upon whether the outcome of the measurement 
was zero or one. It is interesting that T thus corresponds 
to a classical CNOT operation. 

Proof: We already know that R(p(U)) < R g (U) < 
R S (U), so it suffices to prove that R S (U) < R(p(U)). To 
prove this, we use the construction of Vidal and Tarrach, 
Eq. (|4*2*)l . to see that p(U) + R(p(U))a p ^ is separable, 
where 

a (p( U )) = o/i,^ ^TOlfc)(fc] ®\l)(l\; (80) 



R{ P {U)) 



k=£l 



\k) = {I RA ®A h )\a); \l) = {B l ®I RB )\f3). (81) 

Using the fact that the Ak and Bi are proportional to 
unitary operations, a calculation shows that tTAB {o~p(U) ) 
is a completely mixed, separable state. By Theorem y] we 
conclude that there exists a trace-preserving, separable 
quantum operation J- such that p(T) — o~ p (jj) . (Another 
way of seeing this is to directly verify that f as defined 
by Eq. (7BJ) satisfies p(T) = o- p (u)). Thus 

p(U) + R{p(U)) P {T) =p[U + R{ P {U))T] (82) 

is separable, whence U + R(p(U))J- is separable. It fol- 
lows from the definition that R S (U) < R(p(U)), which 
completes the proof. □ 
It is not difficult to verify that R s (£) and R g {£) sat- 
isfy properties similar to those satisfied by the random 
robustness, and thus can be regarded as measures of dy- 
namic strength. The major difference is continuity: the 
lack of an explicit formula for the separable and global 
robustness has prevented us from obtaining quantitative 
continuity statements like those we obtained for the ran- 
dom robustness, although it is still not difficult to argue 
that both quantities are continuous. 



2. General robustness and the threshold for quantum 
computation 

As with the random robustness, we can use R s and 
R„ to obtain bounds on the threshold for quantum com- 



13 



putation. The method for obtaining a bound is similar. 
Suppose we have a quantum computer capable of arbi- 
trary single-qubit gates and a single two-qubit gate, U. 
Then there exists £ such that U + R g {U)£ is separable. 
Now we choose the following noise model: whenever we 
apply U , there is probability p that instead £ occurs. If 
P > Rg{U)/(l + R g {U)) then this set of operations can 
be efficiently simulated classically, and we therefore con- 
clude that pth < R g (U)/(l + R g (U)). Similar remarks 
apply for R S (U), only the noise in that case is restricted 
to be separable. 

Note that both these noise models are more adversar- 
ial, or pessimistic, than the noise model in Sec. IV A 21 
and the threshold bounds are thus tighter. In particu- 
lar, these models allow correlated two-qubit noise, while 
the earlier model assumes independent noise on the two 
qubits. Which model is more realistic obviously depends 
upon which system a gate is implemented in. However, 
we do expect correlated errors similar to those in the 
present models to play a role in many real-world two- 
qubit gates, due to interactions occurring during the gate. 

The bounds obtained using R s and R g are, in gen- 
eral, tighter than those obtained by studying R r , as in 
Sec. IV A2\ However, without specific formulas for R S {U) 
and Rg{U) it is difficult to derive bounds on the thresh- 
old without resorting to numerical calculation. Fortu- 
nately, if the only entangling gate available is of the form 
described by Theorem [5] then we can calculate the op- 
timal noise process, and the corresponding robustness 
R S {U) = R g {U) = R(p{U)). For example, for the CNOT, 
this gives the bound pth < 1/2 on the threshold, since 

.R ff (CNOT) = i? s (CNOT) = 1. 

An alternative approach to proving bounds on the 
threshold is provided by the following general bound on 
the robustness. The bound says, roughly, that if all two- 
qubit unitary gates are available, then without loss of 
generality the worst noise is depolarizing noise. 

Theorem 6 For any trace-preserving quantum operat. 
£, maxuR(U\\£) > maxy R r {U) = d 2 A d%/2. 

As a corollary of the theorem, we see that if all one- 
and two-qubit gates are available, but we don't make any 
assumptions about the noise, the worst possible noise 
will be depolarizing noise, T> ®T>, and the corresponding 
bound on the threshould is p t h < 8/9. 

Proof: Completely depolarizing noise can be repre- 
sented as applying a random unitary operation Vk with 
probability pk, where each Vk = V k ® V k B is a product of 
local gates and ^2pkVkpV k oc / for any density operator 

p. Thus V = EPkVk where V k {p) = V k pV k \ 

Since R is convex in the second argument and T>o£ = T> 
for any operation £, it follows that for any unitary U , 



Hon 



R{U\\V) = R(U\\V o £ ) = R [U 



(83) 



where the last equality follows from the fact that Vk is a 
product of local gates. 

Let R = m&xuR{U\\£) = max[/ R(V%U\\£ ). Then 
Eq. (JEHJl implies that R{U\\V) < EfcPfc-Ro = Ro for 
any U, so max(j R(U\\T>) < maxy R(U\\£ ) for any trace- 
preserving operation £. □ 

We conclude with a result tying our techniques more 
closely to the physical situation. Suppose we are at- 
tempting to perform quantum computation in the lab- 
oratory using a noisy gate, £ , meant to approximate an 
ideal, unitary quantum gate, U. U is known exactly, 
for it is a theoretical construct, and £ has been exper- 
imentally determined using quantum process tomogra- 
phy 0, For what values of p is it possible to 
find a trace-preserving quantum operation, Q, such that 
£ = pU + (1 — p)Q1 The answer to a generalization of 
this question is provided by the following theorem: 

Theorem 7 Let £ and T be trace-preserving quantum 
operations, and let < p < 1 . Then there exists a trace- 
preserving quantum operation Q such that £ — pT + (1 — 
p)Q if and only if the support of p{T) is contained within 
the support of p(£), and 



< 



1 



(84) 



where Ai(-) denotes the largest eigenvalue, and the in- 
verse is a generalized inverse if p{£) is not invertible. 

The theorem is a straightforward consequence of the 
following theorem, and the Jamiolkowski |33j | isomor- 
phism between states and operations. 

Theorem 8 Let p and a be density matrices, and let 
< p < 1. Then there exists a density matrix r such 
that p = pa + (1 — p)t if and only if the support of a is 
contained within the support of p, and 



P< 



1 



Ai(p-V) 



(85) 



< Y,PkR(U\\Vko£) =J2PkR(V k f U\\£) 



where Ai(-) denotes the largest eigenvalue, and the in- 
verse is a generalized inverse if p is not invertible. 

Proof: Suppose p = pa + (1 — p)r. Since a and r 
are positive, it is clear that the support of both a and 
r must be contained within the support of p. It will 
be convenient to work in the vector space corresponding 
to the support of p, so p is invertible. Since t is pos- 
itive, we have p > pa, as an operator inequality. Prc- 
and post-multiplying by p^ 1 ^ 2 gives I > pp~ 1 / 2 ap~ 1 / 2 . 
Comparing the largest eigenvalues of these two operators 
gives the desired inequality. The converse is proved by 
running the argument backward. □ 
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VI. CONCLUSION 

We have defined several measures of the robustness of 
quantum gates against the effects of noise, and used these 
measures to prove that certain noisy quantum gate sets 
can be efficiently simulated on a classical computer, even 
if the methods of fault-tolerant computation are used. 
Our results imply an upper bound on the threshold for 
quantum computation, p t h < 0.5. A key component in 
proving these results was a proof that any quantum com- 
putation involving only separable quantum gates can be 
efficiently simulated on a classical computer. Further- 
more, we have studied gate robustness as a measure of the 
strength of a quantum operation, considered as a phys- 
ical resource, and shown that robustness satisfies many 



properties such a strength measure is expected to have. 
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